A flexible front-end for HTS

نویسندگان

  • Matthew P. Aylett
  • Rasmus Dall
  • Arnab Ghoshal
  • Gustav Eje Henter
  • Thomas Merritt
چکیده

Parametric speech synthesis techniques depend on full context acoustic models generated by language front-ends, which analyse linguistic and phonetic structure. HTS, the leading parametric synthesis system, can use a number of different front-ends to generate full context models for synthesis and training. In this paper we explore the use of a new text processing front-end that has been added to the speech recognition toolkit Kaldi as part of an ongoing project to produce a new parametric speech synthesis system, Idlak. The use of XML specification files, a modular design, and modern coding and testing approaches, make the Idlak front-end ideal for adding, altering and experimenting with the contexts used in full context acoustic models. The Idlak front-end was evaluated against the standard Festival front-end in the HTS system. Results from the Idlak front-end compare well with the more mature Festival front-end (Idlak 2.83 MOS vs Festival 2.85 MOS), although a slight reduction in naturalness perceived by non-native English speakers can be attributed to Festival’s insertion of non-punctuated pauses.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multilingual TTS System of Nokia Entry for Blizzard 2010

In Nokia’s blizzard 2010 entry, we built the system with Nokia multilingual text to speech front end system and two high performance HTS backends. This MLTTS front end system describes the design and implementation designed for universal language coverage and a single code execution for them all based on the assumption that there are more features uniting world languages than differentiating them.

متن کامل

Idlak Tangle: An Open Source Kaldi Based Parametric Speech Synthesiser Based on DNN

This paper presents a text to speech (TTS) extension to Kaldi a liberally licensed open source speech recognition system. The system, Idlak Tangle, uses recent deep neural network (DNN) methods for modelling speech, the Idlak XML based text processing system as the front end, and a newly released open source mixed excitation MLSA vocoder included in Idlak. The system has none of the licensing r...

متن کامل

Development of a bycatch reduction device (BRD) for shrimp beam trawl using flexible materials

  This study aimed to design a bycatch reduction device (BRD) for shrimp beam trawl, which is manufactured by flexible materials to reduce bycatch for the gear in the South Sea of Korea. The model test was carried out to understand the shape of the gear in the water and to measure the variation of flow speed due to the BRD in a circulating water channel. Catches were compared between a shrimp b...

متن کامل

Optimization of an HTS Induction/Synchronous Motor According to Changing of HTS Tapes Critical Current by Analytical Hierarchy Process

This paper represents the performance of a squirrel-cage High Temperature Superconducting Induction/ Synchronous Motor (HTS-ISM) based on nonlinear electrical equivalent circuit. The structure of the HTS-ISM is the same as that of the squirrel-cage type induction machine, and the secondary windings are fabricated by the use of the HTS wires. It has already been shown that based on the experimen...

متن کامل

Development of a bycatch reduction device (BRD) for shrimp beam trawl using flexible materials

  This study aimed to design a bycatch reduction device (BRD) for shrimp beam trawl, which is manufactured by flexible materials to reduce bycatch for the gear in the South Sea of Korea. The model test was carried out to understand the shape of the gear in the water and to measure the variation of flow speed due to the BRD in a circulating water channel. Catches were compared between a shrimp b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014